Identiication of Text on Colored Book and Journal Covers

نویسندگان

  • Karin Sobottka
  • Horst Bunke
  • Heino Kronenberg
چکیده

In this paper an approach to automatic text location and identiication on colored book and journal covers is proposed. To reduce the amount of small variations in color, a clustering algorithm is applied in a preprocessing step. Two methods have been developed for extracting text hypotheses. One is based on a top-down analysis using successive splitting of image regions. The other is a bottom-up region growing algorithm. The results of both methods are combined to robustly distinguish between text and non-text elements. Text elements are binarized using automatically extracted information about text color. The binarized text regions can be used as input for a conventional OCR module. Results are shown for several book and journal covers of diierent complexity. The proposed method is not restricted to book and journal cover pages, but can be applied to the extraction of text from other types of color images as well.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Text on Colored Book and Journal Covers

In this paper an approach to automatic text location and identification on colored book and journal covers is proposed. To reduce the amount of small variations in color, a clustering algorithm is applied in a preprocessing step. Two methods have been developed for extracting text hypotheses. One is based on a top-down analysis using successive splitting of image regions. The other is a bottom-...

متن کامل

Intracranial Arterial Aneurysms

In its scope, it occupies an intermediate position between such works as are primarily concerned with parasitology and the outstanding text-book by Dr. Strong on "Diagnosis, Prevention, and Treatment of Tropical Diseases" which covers so completely and thoroughly not only "tropical medicine" in its narrower sense, but also so many of the contributing sciences as well. The content of Dr. Bercovi...

متن کامل

Journals Subheadlines Text Extraction Using Wavelet Thresholding and New Projection Profile

In this paper a new robust and efficient algorithm to automatic text extraction from colored book and journal cover sheets is proposed. First, we perform wavelet transform. Next for edge detecting from detail wavelet coefficient, we use dynamic threshold. By blurring approximate coefficients with alternative heuristic thresholding, achieve effective edge,. Afterward, with ROI technique get bina...

متن کامل

INVESTIGATION OF BARRIERS AND REQUIREMENTS AFFECTING E-SHOPPING BEHAVIOR OF CUSTOMERS IN THE BOOK MARKET

<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: justify; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; backgro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999